Demographer: Extremely Simple Name Demographics

نویسندگان

  • Rebecca Knowles
  • Josh Carroll
  • Mark Dredze
چکیده

The lack of demographic information available when conducting passive analysis of social media content can make it difficult to compare results to traditional survey results. We present DEMOGRAPHER,1 a tool that predicts gender from names, using name lists and a classifier with simple character-level features. By relying only on a name, our tool can make predictions even without extensive user-authored content. We compare DEMOGRAPHER to other available tools and discuss differences in performance. In particular, we show that DEMOGRAPHER performs well on Twitter data, making it useful for simple and rapid social media demographic inference.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Working Hard for the Money Trends in Women ’ s Employment 1970 to 2007 KrisTin sMiTH rEporT s on rur al aMErica

the neil and louise Tillotson Fund of the new Hampshire charitable Foundation. Family Demographer The carsey institute university of new Hampshire

متن کامل

The curiously misunderstood role of evidence in designing new technology.

" Flying safely has not developed using experimental and control groups of air passengers and counting victims. " —Ülo Kristjuhan 75 R ECENTLY, I TOOK PART in a radio interview together with the demographer Jay Olshansky. Jay and I have been friends for over a decade, and we have done this before: indeed , he and I have for most of that period been among the most frequently appearing academic g...

متن کامل

Identifying Participants in the Personal Genome Project by Name (A Re-identification Experiment)

We linked names and contact information to publicly available profiles in the Personal Genome Project. These profiles contain medical and genomic information, including details about medications, procedures and diseases, and demographic information, such as date of birth, gender, and postal code. By linking demographics to public records such as voter lists, and mining for names hidden in attac...

متن کامل

Author name disambiguation: What difference does it make in author-based citation analysis?

In this paper, we explore how strongly author name disambiguation (AND) affects the results of an author-based citation analysis study, and identify conditions under which the commonly used simplified approach of using surnames and first initials may suffice in practice. We compare author citation ranking and co-citation mapping results in the stem cell research field 2004-2009 between two AND ...

متن کامل

Path ORAM: An Extremely Simple Oblivious RAM Protocol Citation

We present Path ORAM, an extremely simple Oblivious RAM protocol with a small amount of client storage. Partly due to its simplicity, Path ORAM is the most practical ORAM scheme known to date with small client storage. We formally prove that Path ORAM has a O(logN) bandwidth cost for blocks of size B = Ω(logN) bits. For such block sizes, Path ORAM is asymptotically better than the best known OR...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016